AITopics

2604.2326

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

López-Montero, Daniel, Álvarez-López, Antonio, Matabuena, Marcos

Gaussian mixture models in Hilbert spaces via kernel methods

arXiv.org Machine LearningMay-8-2026

Modern datasets across many disciplines increasingly consist of time-evolving, potentially infinite-dimensional random objects, such as dynamic functional data, which are naturally modeled in Hilbert spaces. In these settings, characterizing probability measures, for example, through densities, can be ill-defined or technically challenging. Motivated by clustering applications, we propose a Gaussian mixture framework for Hilbert-space-valued data based on kernel mean embeddings and develop efficient optimization algorithms for estimation. We establish theoretical guarantees showing that the proposed algorithm is well defined and that the model yields a dense class of approximations in infinite-dimensional spaces. We evaluate the framework through extensive experiments on diverse structures and data geometries, including $L^2$-functional data and random graphs in Laplacian spaces arising in modern medical applications.

artificial intelligence, hilbert space, machine learning, (19 more...)

2605.05996

Country: Europe > Germany > Bavaria > Middle Franconia > Nuremberg (0.40)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Health Care Technology (0.93)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Bark, Stephan, Malik, Waqas Ahmed, Prus, Maryna, Piepho, Hans-Peter, Schmid, Volker

A Bayesian Updating Framework for Long-term Multi-Environment Trial Data in Plant Breeding

arXiv.org Machine LearningApr-20-2026

In variety testing, multi-environment trials (MET) are essential for evaluating the genotypic performance of crop plants. A persistent challenge in the statistical analysis of MET data is the estimation of variance components, which are often still inaccurately estimated or shrunk to exactly zero when using residual (restricted) maximum likelihood (REML) approaches. At the same time, institutions conducting MET typically possess extensive historical data that can, in principle, be leveraged to improve variance component estimation. However, these data are rarely incorporated sufficiently. The purpose of this paper is to address this gap by proposing a Bayesian framework that systematically integrates historical information to stabilize variance component estimation and better quantify uncertainty. Our Bayesian linear mixed model (BLMM) reformulation uses priors and Markov chain Monte Carlo (MCMC) methods to maintain the variance components as positive, yielding more realistic distributional estimates. Furthermore, our model incorporates historical prior information by managing MET data in successive historical data windows. Variance component prior and posterior distributions are shown to be conjugate and belong to the inverse gamma and inverse Wishart families. While Bayesian methodology is increasingly being used for analyzing MET data, to the best of our knowledge, this study comprises one of the first serious attempts to objectively inform priors in the context of MET data. This refers to the proposed Bayesian updating approach. To demonstrate the framework, we consider an application where posterior variance component samples are plugged into an A-optimality experimental design criterion to determine the average optimal allocations of trials to agro-ecological zones in a sub-divided target population of environments (TPE).

artificial intelligence, machine learning, variance component, (19 more...)

2604.16203

Country:

Europe > Germany (0.14)
Asia > Bangladesh (0.04)
North America > United States > New York (0.04)
Europe > Netherlands (0.04)

Genre: Research Report > Experimental Study (0.40)

Industry: Food & Agriculture > Agriculture (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Bardone, Lorenzo, Merger, Claudia, Goldt, Sebastian

A theory of learning data statistics in diffusion models, from easy to hard

arXiv.org Machine LearningMar-16-2026

While diffusion models have emerged as a powerful class of generative models, their learning dynamics remain poorly understood. We address this issue first by empirically showing that standard diffusion models trained on natural images exhibit a distributional simplicity bias, learning simple, pair-wise input statistics before specializing to higher-order correlations. We reproduce this behaviour in simple denoisers trained on a minimal data model, the mixed cumulant model, where we precisely control both pair-wise and higher-order correlations of the inputs. We identify a scalar invariant of the model that governs the sample complexity of learning pair-wise and higher-order correlations that we call the diffusion information exponent, in analogy to related invariants in different learning paradigms. Using this invariant, we prove that the denoiser learns simple, pair-wise statistics of the inputs at linear sample complexity, while more complex higher-order statistics, such as the fourth cumulant, require at least cubic sample complexity. We also prove that the sample complexity of learning the fourth cumulant is linear if pair-wise and higher-order statistics share a correlated latent structure. Our work describes a key mechanism for how diffusion models can learn distributions of increasing complexity.

artificial intelligence, cit, machine learning, (18 more...)

2603.12901

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Italy > Friuli Venezia Giulia > Trieste Province > Trieste (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Georgios Arvanitidis, Lars K. Hansen, Søren Hauberg

A Locally Adaptive Normal Distribution

Neural Information Processing SystemsFeb-18-2026, 21:01:05 GMT

The underlyingmetricis,however,non-parametric.Wedevelopamaximumlikelihood algorithm to infer the distribution parameters that relies on a combination of gradient descent and Monte Carlo integration. We further extend the LAND to mixture models, andprovidethecorresponding EMalgorithm.

artificial intelligence, machine learning, manifold, (18 more...)

Country:

North America > United States > New Jersey > Hudson County > Secaucus (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > United Kingdom > England > Tyne and Wear > Sunderland (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Neural Information Processing SystemsFeb-9-2026, 19:58:31 GMT

8ccfb1140664a5fa63177fb6e07352f0-Supplemental.pdf

A.1 Notationandpreliminaries We consider the metric space(X,d( , )) where d : X X R+. We consider`( , ) to be the cross-entropyloss `log(M(x),y), ylogσ(M(x)) (1 y)logσ((1 M(x))) (whereσ(x) = 11+exp( x) isthe sigmoid function) orthe`2 loss.

artificial intelligence, exp 1 2, machine learning, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.87)

Neural Information Processing SystemsFeb-8-2026, 19:16:48 GMT

6c81c83c4bd0b58850495f603ab45a93-Supplemental.pdf

We first consider the dynamical partition function, correspondingtoEq.

artificial intelligence, machine learning, ynew, (17 more...)

Country:

Europe > Switzerland (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
Asia > Singapore (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Neural Information Processing SystemsFeb-8-2026, 17:25:57 GMT

SupplementaryMaterial MatrixCompletionwithHierarchical GraphSideInformation

This implies that M(δ) = T(δ), i.e., the constraint(13) made in T(δ) does not lose any generality in matrix representation. One technical distinction relative to the previous works [2,3] arises from the fact that in our setting, the hamming distances(dx1(`),dx2(`),dx3(`)) defined w.r.t. We focus on the family of rating matrices{Mhci: c T`}. First, we present the following lemma that guarantees the existence of two subsets of users with certainproperties. The proof of this case follows the same structure as that of the grouping-limited regime. It is shown that the groups within each cluster are recovered with a vanishing fraction of errors if Ig = ω(1/n).

artificial intelligence, log 1, machine learning, (18 more...)